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Abstract 

We present a new decision rule, maximin safety, that seeks to maintain 
a large margin from the worst outcome, in much the same way minimax 
regret seeks to minimize distance from the best. We argue that maximin 
safety is valuable both descriptively and normatively. Descriptively, max¬ 
imin safety explains the well-known decoy effect, in which the introduc¬ 
tion of a dominated option changes preferences among the other options. 
Normatively, we provide an axiomatization that characterizes preferences 
induced by maximin safety, and show that maximin safety shares much 
of the same behavioral basis with minimax regret. 


1 Introduction 

Representing uncertainty using a probability distribution, and making decisions 
by maximizing expected utility, is widely accepted, founded on formal math¬ 
ematical principles, and satisfies intuitive notions of rationality such as inde¬ 
pendence of irrelevant alternatives and the sure thing principle [20) . However, 
enforcing seemingly appealing concepts of rationality can ultimately lead to de¬ 
cisions inconsistent with what real humans consider reasonable. For example, 
observed behavior under unquantified (Knightian [TJ]/ strict m) uncertainty, 
such as that in the Ellsberg paradox [8], demonstrates how appealing concepts 
of rationality can lead to inconsistency with human choices. Alternative de¬ 
cision rules, such as maximin utility [25] and minimax regret nnum provide 
rationally plausible decisions in ambiguous situations and can be used to resolve 
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such paradoxes, but still fail to explain some human behavioral patterns. A par¬ 
ticularly illustrative example of such behavior is called the decoy effect [la , in 
which the introduction of a dominated option changes the preference among the 
undominated ones. While the decoy effect has been investigated in the psychol¬ 
ogy [3 US] and economics literature Elisa, we are unaware of any axiomatic 
treatment of it. To address this, we introduce a criterion called safety as the 
basis for a maximin safety decision rule. 0 Safety serves as a dual to regret that 
quantifies distance from a worst outcome, much as regret quantifies proximity 
to a best outcome. Maximin safety also satisfies familiar properties common to 
maximin utility and minimax regret, and hence also resolves the Ellsberg para¬ 
dox. Moreover, maximin safety accommodates observed preferences that are 
incompatible with minimax regret and maximin utility. We demonstrate how 
safety-seeking behavior can produce the decoy effect, and show how maximin 
safety can explain it. We also extend Stoye’s [23] axiomatizations of standard 
decision rules to include maximin safety, thus allowing a comparison between 
maximin safety and state-of-the-art decision rules. 

1.1 Relative Preferences and Regret 

It is not hard to imagine situations in which performance relative to other 
possible outcomes is more important than absolute performance. Consider, for 
example, a group of duck hunters surprised by a hungry bear Eli]. The hunters 
all attempt to escape by running in the same direction while the slowest one 
despairs: “this is hopeless, we can never outrun the bear.” The hunter in front 
of him snickers, “I don’t need to outrun the bear, I just need to outrun youJ' 
Whether the prospect is being picked from a group of peers for a date [2] , win¬ 
ning a gold medal, or obtaining an ‘A’ in a class, success is often measured by 
relative performance, rather than by an absolute standard. One such preference 
for relative performance is embodied in the well-known decision theoretical con¬ 
cept of regret uniin]. While psychological literature on regret focuses on the 
bad feelings that occur after a choice leads to an inferior outcome, some also 
considers that anticipation of such negative emotions may influence the choice 
itself Emmu]. 

In this paper, we assume that uncertainty is captured by a set of possible 
worlds, one of which is the true state of the world. Regret is a measure of 
distance between the value of a considered outcome and the value of the best 
possible outcome, under a given state. This leads to an important property 
that is always true for regret - the introduction of a dominated option does 
not change the regrets of the existing options. We will refer to this property 
as independence of dominated alternatives (IDA). Those who believe in regret 
avoidance may think that this property is perfectly reasonable. For example, 
suppose you have a $10 bill and you can either buy a $10 lottery ticket, or two 
$5 lottery tickets. Most would agree that your choice should not be affected 

^This decision rule has been mentioned in passing, inside a proof by Hayashi m, where it 
was referred to as ‘maximin joy’. We use the term ‘safety’ rather than ‘joy’ to avoid confusion 
with the concept called ‘joy of winning’ in m- 
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by a dominated third option, “burning the $10 bill”. Other standard decision 
rules, such as expected utility maximization, have even stronger independence 
guarantees. The ranking of two choices under expected utility maximization is 
menu-independent^ i.e., completely independent of the set of feasible choices (the 
menu). Menu-independence implies IDA. In contrast, regret-based preferences 
are menu-dependent, but since they conform to IDA, they are not compati¬ 
ble with observed biases sensitive to dominated options [2]. While IDA seems 
intuitively appealing, there is a great deal of empirical evidence that human pref¬ 
erences are indeed affected by dominated options in measurable and sometimes 
profound ways. 

1.2 The Decoy Effect in Decision Theory 

Suppose you are offered $6 in cash, and the option of trading it for a Cross pen. 
The pen is nice, but you have plenty of pens, so decide to keep the cash. Right 
before you walk away, you are offered an alternative pen in exchange for the $6. 
You see the new pen and find it hideous. A smile comes to your face as you 
turn around and say, “you know. I’ll take that original Cross pen after all.” 

This story dramatizes an actual experiment [T3]. When the first choice was 
offered to 106 people, 64% took the cash, 36% took the pen. When the second 
pen was added to the offer to 115 other subjects, 52% took the cash, 46% took 
the Cross pen, and 2% took the decoy. Generally, a decoy is an option that is 
designed to be inferior to another option in every way (i.e., it is a dominated 
option). Despite the intuitive appeal of IDA, the presence of a dominated option 
drove selection of the Cross pen from 36% to 46%. In this paper, we focus on 
a particular class of decoy effect, called asymmetric dominance, which occurs 
when the decoy is dominated by one existing alternative, but not by another. 
Empirical studies show that the decoy is rarely chosen, but its addition to a set 
of choices consistently drives decision makers toward the dominating choice. 

Numerous empirical studies have also shown decoy effects in class action 
settlements m, recreational land management [J, choice of healthcare plans 
and political candidates [12], purchase of consumer goods such as cameras and 
personal computers [22] . restaurant choices [13] . and even romantic attraction 
|5]. Surprisingly, a decoy effect can occur even if the decoy is not actually an 
option, but merely a recent memory of an option (a phantom decoy BE])- 
Furthermore, the decoy effect is not limited to humans, but is also observed in 
honeybees and grey jays m- 

In an attempt to explain the decoy effect, experts in the behavioral sciences 
have offered a variety of domain-specific analyses, including “perceptual fram¬ 
ing” [13], “value-shift” [26], “extremeness aversion” [22], and “contrast bias” 
[nm]. All of these explanations focus on valuing the discrepancy between the 
decoy and the dominating alternative. Intuitively, this provides a compelling 
example of preferring the margin of safety from the worst outcome. As we are 
not aware of any formalization in decision theory that is consistent with the 
intuitive preference for “margin of safety”, we offer one here. 
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To illustrate our new decision rule, recall the example 
of the unfortunate duck hunters. As they run from the 
bear, they approach a blind curve and have no idea what 
is around it: it could be wet or dry. If it is dry they will 
cover the most ground if they try to run faster, however if 
it is wet (thus slippery) they will be better off if they slow 
down and maintain balance. The options and the distance 
traveled under each circumstance are summarized in Ta¬ 
ble [H In general, exerting excessive effort on a wet road 
leads to slipping and less distance covered; exerting effort 
on a dry road leads to more distance covered. 

If the probability of the road conditions is unknown, and only the first two 
options are available (sprint and hustle), there is no intuitively preferred choice 
and we may assume there are enough hunters such that at least one will pick each 
option. However, if we add a new option, jog, something interesting happens. As 
jog is dominated by hustle, IDA requires that its availability should not change 
the preferences among the other options. However, regardless of whether the 
road is wet or dry, hustle is never the worst alternative: if the road is wet, hustle 
(3) is faster than sprint (1), and if the road is dry, hustle (6) is faster than jog 
(2). In either case, selecting hustle prevents the hunter from being the slowest 
and getting caught by the bear. 

While it may be callous, it seems perfectly reasonable for a hunter to decide 
to run just fast enough to make sure there is someone behind him. In other 
words, the most sensible decision might be to run just fast enough to guarantee 
the maximum possible margin between himself and the slowest runner, in the 
worst scenario. This margin between the hunter and his slowest compatriot can 
be considered a measure of safety, which is at the heart of our paper. 

The rest of the paper proceeds as follows. Section provides a formaliza¬ 
tion of the decoy paradox along with basic decision-theoretical notation. Sec¬ 
tion [3] describes the relationship between minimax regret and maximin safety 
and shows how maximin safety resolves the decoy paradox. Section |4] provide 
an axiomatic characterization of maximin safety. Section [S] suggests a unifica¬ 
tion of utility, regret, and safety using anchoring functions, and also considers 
a generalization to qualitative relative preferences. 



Wet 

Road 

Dry 

Road 

Sprint 

1 

9 

Hustle 

3 

6 

Jog 

2 

2 


Table I: Hunters 
running from a 
bear. 


2 The Formal Framework 

Given a set S of states and a set X of outcomes, an act a (over S and A) is a 
function mapping S to X. The set of all acts is thus X^, which we will denote 
by A. For simplicity in this paper, we take S to be finite. Associated with each 
outcome a: G A is a utility: U(x) is the utility of outcome x. For convenience, 
we will omit the explicit representation of the outcome, and denote U(a(s)) by 
U{a, s) for each state s G S. We call a tuple {S, X, U) a (non-probabilistic) 
decision problem. To define regret and safety, we need to assume that we are 
also given a set M C A of feasible acts, called the menu. The reason for the 
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menu is that, as we have shown, regret and safety can depend on the menu. 
We will only consider finite menus, from which randomized strategies can be 
chosen. 

Consider the problem of 
a decision maker (DM) con¬ 
templating a camera purchase, 
summarized in Table m The 
DM has a choice between buy¬ 
ing a rugged travel camera (oi) 
that takes decent pictures in a 
wide variety of circumstances, 
and buying a delicate sports 
camera with higher speed and 
image quality ( 02 )- Each state characterizes the possible situations that a pur¬ 
chaser may experience during the useful life of the camera (Will the DM expe¬ 
rience harsh conditions? Or win tickets to the World Cup?) The utility U{a, s) 
of act a under state s represents an abstract net value to the DM if the true 
world is state s. 

If the DM ends up going on a safari (si), then act oi results in moderate 
quality pictures of exciting wildlife (17(ai,Si) = 4), but act 02 results in a 
few exquisite shots and many missed opportunities (C/(a 2 ,si) = 2). On the 
other hand, if the DM goes to the World Cup ( 52 ), then act 02 results in many 
great pictures in a safe environment {U(a 2 , S 2 ) = 6), while act ai provides only 
moderate quality pictures ({7(ai,S2) = 4). 

If the DM can assign probabilities T’(si) and P{s 2 ) to the states, she can 
calculate an expected utility E[U{ai)] = X^seS®)’ simply select 
the act that maximizes expected utility. However, if the state probabilities are 
unavailable, we have unquantified uncertainty. In such cases, the DM must find 
another method for aggregating the utility of each act across states in order to 
assign a value to each camera. Here we will focus on the methods of maximax 
utility, minimax utility, and minimax regret. To understand minimax regret, 
we need to define the notion of regret. For a menu M and act a € M, the regret 
of a with respect to M and decision problem {S, X, U) is 

max(max U(a' s) — U(a, s)). 

(S JC U) 

We denote this as reg)^' ’ '(a), and usually omit the superscript (S,X, U). 

When comparing decision rules, it is often convenient to define a value func¬ 
tion that assigns a numeric value to each act, for the purpose of ranking the 
acts. Formally, for a decision problem (5", X, [/), a value function is a function 

ViS,x,u) M):X^x2^^ K. 

We will usually omit the superscript {S,X, U) and just write V{a,M), or V{a) 
if the value function is menu-independent. 



Si 

Safari 

S2 

World Cup 

oi: Travel 

4 

4 

02 - Sports 

2 

6 

03 : Decoy 

3 

3 


Table 2: Utilities in the camera purchase ex¬ 
ample. 
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Decision Rule 

Value of an act a 

Decision rule description 

Best 

maximax utility 

V(a) = maxggs U (a, s) 

Optimize the best-case outcome. 

02 

maximin utility 

V{a) = minses t/(a, s) 

Optimize the worst-case outcome. 

Oi 

minimax regret 

V{a,M) = -regMia) 

Pick an act to minimize the worst- 
case distance from the best outcome. 

Oi, 02 


Table 3: Standard decision rules and most valued acts in the camera example. 

We say that the value function V represents the family of preference relations 
if for all menus M and all a, a' G M, 

o '?~v,M a V (a, M) > V{a , M). 

In other words, act a is (strictly) preferred to act a' with respect to menu M if 
and only if V{a, M) > V{a', M). The value functions and preferences of several 
standard decision rules are given in Table [3] 

Now, perhaps the camera vendor would like to sell more travel cameras, so 
the vender puts an obsolete travel camera 03 next to oi as a decoy. Camera 
03 has the same price as oi, but fewer features and lower picture quality. The 
vendor hopes to make oi more appealing by contrast with 03 . Table[2]illustrates 
the decision problem when 03 is added to the menu. The ranking between 
a I and 02 according to each of the decision rules in Table [3] is unaffected by 
the introduction of 03 to the menu. The addition of 03 also illustrates the 
concept of dominance. We say that an act a dominates a', if for all s € S, 
U{a,s) > U{a',s). 

3 Maximin Safety 

While minimax regret seeks to minimize separation from best outcomes, max¬ 
imin safety is a conceptual dual that seeks to maximize separation from the 
worst outcomes. For a menu M and act a G M, the safety of a in state s is 
defined as: 

safety^^’^’^\a, s) = U{a,s) - min ([/(o', s)), 

a'^M 

and in keeping with the convention for regret, the safety of an act is defined as: 
safety^M^’^\a) = mm{safety^M^’^\a, s)). 

We will often omit the superscript {S,X, U). 

The family of maximin safety preferences >-saf,M represented by the safety 
value function satisfies, for all M and a, a' G M, 

a >-saf,M a' O safety^{a) > safety 

Now we reconsider the camera example using safety (Table |4]) . Without 
the decoy, both acts have the same safety of 0 , since each act has the lowest 
utility in some state; so there is no clear safety preference. However, when the 
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Utility 

Safety 

(no 

decoy) 

Safety 

(w. 

decoy) 


Si 

S2 

Si 

S2 

Si 

S2 

oi: travel 

4 

4 

2 

0 

2 

1 

02: sports 

2 

6 

0 

2 

0 

3 

03: decoy 

3 

3 


1 

0 


Table 4: Camera purchase with and without decoy. 



Utility 

Regret 

Safety 

Optimal for 


Si 

S2 

Si 

S2 

Si 

S2 


Oi 

1 

9 

3 

0 

0 

5 

maximax utility 

02 

3 

6 

1 

3 

2 

2 

maximin safety 

03 

2 

7 

2 

2 

1 

3 

minimax regret 

(X 4 

4 

4 

0 

5 

3 

0 

maximin utility 


Table 5: Different decision rules select different acts for the same problem. 


decoy is present, the act oi never has the lowest utility at any state, and thus 
it has a strictly positive safety. In this case, a3}(ai) is the unique 

maximum among the acts {oi, 02 , 03 }, and therefore oi is the preferred choice. 
The relative increase in the safety of an act due to the addition of the dominated 
act is an essential element in solving the decoy paradox. Intuitively, this may 
correspond to a sense that even if a particular act gets low utility in the realized 
state, the DM may think that “I’m better off than the fools who bought the 
worse camera”, or in a more positive light, “I must be getting a steal with this 
better camera for the same price”. In competitive survival games (such as the 
reality game show Survivor), the notion of maximizing safety may also embody 
a preference to maintain a maximal distance from the lowest performer, which 
reduces the chance of elimination. Table [5] compactly demonstrates how choices 
based on maximin safety differs from the other standard decision rules. 

In the camera exam¬ 
ple, the addition of a de¬ 
coy created a strict pref¬ 
erence between two acts 
that were initially tied. 

The introduction of a 
dominated act can actu¬ 
ally reverse preferences 
between acts. Table El 

shows a menu of three ^ r i j 

, ,^r 1 fable d: Without 04 , M = 101 , 02 , 03 !, and 

acts: M = {01,02,03}. . j u \ 

Act 02 has a minimum a,. Adding 04 (dominated by 03 ) reverses 

the maximin saiety prererence between a 2 and a^. 



utility 

Safety(no 

decoy) 

Safety (w. 
decoy o4) 


Si 

S2 

S 3 

Si 

S2 

S 3 

Si 

S2 

S 3 

Ol 

9 

2 

6 

5 

0 

0 

8 

0 

0 

02 

5 

3 

7 

1 

1 

1 

4 

1 

I 

03 

4 

8 

8 

0 

6 

2 

3 

6 

2 

04 

1 

5 

6 


0 

3 

0 
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safety of 1, while both ai 
and 03 have the lowest 

utility for some state, so each has minimum safety of 0. Consequently, 02 is 
the most preferred choice under the safety preference. When a new choice 04 
is added, act 04 is dominated by 03, but it has higher utility than the other 
acts in some states. This situation is known as asymmetric dominance^ which 
is typically associated with decoy effects. In this example, asymmetric dom¬ 
inance guarantees that 03 is never one of the worst choices, and thus has a 
strictly positive safety value. In other words, the addition of 04 to the menu 
M does not affect the safety of oi or 02, but increases the safety of 03 to make 

^3 saf ,M\j{a 4 } ^2 • 


4 Axiomatic Analysis 

To provide an axiomatic characterization of maximin safety, we employ the 
standard Anscombe-Aumann (AA) framework [T], where outcomes are restricted 
to lotteries. Maximin safety is characterized by modifying one of the axioms in 
an existing characterization of minimax regret provided by Stoye |24j . 

Given a set Y of prizes^ a lottery over Y is just a probability with finite 
support on y. As in the AA framework, we let the set of outcomes be A(y), 
the set of all lotteries over Y. Thus, acts are functions from S to A(y). We 
can think of a lottery as modeling objective, quantified uncertainty, while the 
states model unquantified uncertainty. The technical advantage of considering 
such a set of outcomes is that we can consider convex combinations of acts. If 
/ and g are acts, define the act af + {1 — a)g to be the act that maps a state 
s to the lottery af{s) + (I — a)g{s). For simplicity, we follow Stoye [23] and 
restrict to menus that are the convex hull of a finite number of acts, so that if 
/ and g are acts in M, then so is pf + {1 — p)g for all p G [0,1]. 

In this setting, we assume that there is a utility function U on prizes in Y, 
and that there are at least two prizes yi and ?/2 in Y^ with different utilities. 
Note that l{y) is the probability of getting prize y if lottery I is played. We 
will use I* to denote a constant act that maps all states to 1. The utility of 
a lottery I is just the expected utility of the prizes obtained, that is, u{l) = 
J2{y^Y ■ i{y)>o} Ky)Uiy)- The expected utility of an act / with respect to a 
probability Pr is then just u{f) = X^sgS usual. Given a set 

Y of prizes, a utility U on the prizes, and a state space S', we have a family 
preference orders on acts determined by maximin safety, where u 
is the utility function on lotteries as determined by uE For convenience, from 
here on we will write rather than We will state the axioms 

in a way such that they can be compared to standard axioms and those for 
minimax regret in |24] . The axioms are universally quantified over acts /, g, 
and h, menus M and M' , and p G (0,1). Whenever we write / g we assume 
that f,g G M. 


2 We let / g iS g /, and / ~m 9 iff / bju 9 and g Gm /■ 



Axiom 1. (Monotonicity) f >m 9 */(/(«))* ^{(/(s))*,(s(s))*} ( 5 (s))*)Vs e S . 
Axiom 2. (Completeness) f 9 or g /• 

Axiom 3. (Nontriviality) f >-m 9 for some acts f and g and menu M. 

Axiom 4. (Mixture Continuity) If f )^m 9 >-m h, then there exists q,r € (0,1) 
such that qf + {1 — q)h >-m 9 >M rf + {1 — r)h. 

Axiom 5. (Transitivity) f 9 ciM f hm h. 

Menu-independent versions of Axioms [T] to [5] are standard in other axiom- 
atizations, and in particular hold for maximin utility. Axiom |3] is used in the 
standard axiomatizations to get a nonconstant utility function in the representa¬ 
tion. While maximin safety does not satisfy menu-independence, it does satisfy 
menu-independence when restricted to menus consisting of only constant acts. 
This property is captured by the following axiom. 

Axiom 6. (Menu independence for constant acts) If I* and {I')* are constant 
acts, then I* >m (I')* iff I* ^m' {I')* ■ 

We also have a menu-dependent version of the von Neumann-Morgenstern 
(VNM) Independence axiom. Like the VNM Independence axiom, Axiom 0 
says that ranking between two acts does not change when both acts are mixed 
with a third act; but unlike VNM Independence, the menu used to compare the 
original acts in Axiom [7] is different from that used to compare the mixtures. 
Axiom [3 holds for minimax regret and maximin safety, but not for maximin 
utility. 

Axiom 7. (Independence) f 9 pf + — p)h 'tZpM+(i-p)h + (1 “ p)h. 

Axiom 8. (Symmetry) For a menu M, suppose that E, F € 2'®\{0} are disjoint 
events such that for all f G M, f is constant on E and on F. Define f by 

{ f{s') for some s' £ E, if s £ E 
f{s') for some s' £ E, if s £ E 
f{s) otherwise 

Let M' be the menu generated by replacing every act f £ M with f'. Then 


f hM 9 f 'Cm' g' 


Symmetry, which is one of the characterizing axioms for minimax regret 
in |24) . captures the intuition that no state can be considered more or less 
likely than another. Therefore Symmetry helps distinguish the probability-free 
decision rules maximin utility, minimax regret, and maximin safety, from their 
probabilistic counterparts mm- 

Axiom 9. (Ambiguity Aversion) f 5 p/ + (1 ~ p)9 9- 
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Axiom 0 says that the decision maker weakly prefers to hedge her bets. 
Axioms l-l are all part of the characterization in |24) of minimax regret (which 
consists of Axioms l-lHl and Symmetry). Axioms 1-5 and [5] are also sound for 
the maximin decision rule [24) . 

In [24] , one of the axioms characterizing minimax regret is Independence of 
Never Strictly Optimal alternatives (INA), which states that adding or removing 
acts that are not strictly potentially optimal in the menu does not affect the 
ordering of acts. @ By varying this INA axiom, we obtain a characterization for 
maximin safety. We say that an act a is never strictly worst relative to M if, 
for all states s £ S, there is some a' £ M such that a{s) ^ a'(s). 

Axiom 10. (Independence of Never Strictly Worst Alternatives (INWA)) If an 
act a is never strictly worst relative to M, then f iff / hMu{a} 9- 

Although adding acts to the menu, in general, can affect minimax regret 
preferences, INA implies the Independence of Dominated Alternatives property 
that we used earlier when discussing the decoy effect. Thus, INA guarantees 
that minimax regret can never be compatible with the decoy effect. 

Theorem 1. For all Y,U,S, the family of maximin safety preference orders 
—s^M ‘iiT'duced by a decision problem {S, A(Y),u) satisfies AxiomsHUKA Con¬ 
versely, if the family of preferenee orders on the acts in A(F)'^ satisfies 
Axioms MM then there exists a utility function U on Y that determines a 
utility u on A(Y) such that Moreover, U is unique up to affine 

transformations. 

Proof. The soundness of the axioms are straightforwardly verified, so we show 
only the completeness of the axioms. We will use the same general sequence 
of arguments that Stoye uses in [24]. First, we establish a nonconstant utility 
function U, where constant acts are ranked by their expected utilities. Since we 
have the standard axioms (1 — 5), we get U from standard arguments, and it is 
unique up to affine transformations. Next, we observe the following lemma: 

Lemma 1. Suppose the family Am satisfies Axioms 1-10, and Am+ is repre¬ 
sentable by maximin safety, where M~^ is the menu of all acts with nonnegative 
utilities. Then the family Ym is representable by maximin safety. 

Lemma |T] follows from an argument analogous to that for regret in [23] . The 
next step is to establish that the axioms on Am restrict Am+ to satisfy the 
axioms of ambiguity aversion, monotonicity, completeness, transitivity, non¬ 
triviality, and symmetry. It is a straightforward verification that will not be 
reproduced here. Theorem 1 (hi) of [24] then implies that Am+ is the maximin 
utility ordering. Next, let gn be an act such that uogM{s) = — min/jgM u{h, s), 
so that we have 

® An act h is never strictly optimal relative to M if, for all states s £ S, there is some 
f £ M such that (/(s))* A [hiO* ■ 
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f hM 9 If + \gM ^M+ \9 + \9M 

miiisgs u{\f + \gM, s) > min^gs u{\g + \gM, s) 

^ min(-(M(/,s) - minM(/i,s))) > min(-(u(g, s) - minM(/i, s))). 
ses 2 heM ses 2 heM 

□ □ 

The characterizing axioms serve as a justification for maximin safety in the 
sense that behaving as a safety maximizer is equivalent to accepting the axioms. 
Axioms 1-7 are standard and broadly accepted to be reasonable, while symme¬ 
try and ambiguity aversion are implied by both maximin utility and minimax 
regret. Whether the INA axiom (for regret) or the INWA axiom (for safety) 
is more reasonable would depend on the individual and the nature of the deci¬ 
sion problem. Thus, we believe that the reasonableness of the maximin safety 
decision rule is comparable to that of minimax regret. 

Individual necessity of the axioms can be established, as is commonly done 
[TTl , by giving examples of preferences that satisfy all the axioms except for 
the one whose necessity is being shown. For the axioms shared with minimax 
regret, the same examples found in |24j shows their individual necessity. For 
the INWA axiom, the required example is minimax regret. Indeed, a decision 
rule equivalent to maximin safety was used by Hayashi m as an example to 
justify minimax regret’s entailment of INA. 

Clearly, just as minimax regret is readily generalized to minimax expected 
regret when uncertainty is represented by a set of probability distributions over 
the state space, maximin safety can be readily extended to maximin expected 
safety in the same manner. As one would expect, given an axiomatization of 
minimax expected regret [23], the modification of the INA axiom to INWA 
results in an axiomatization for maximin expeeted safety. 


5 Discussion, Generalizations, and Future Work 

Both minimax regret and maximin safety embody preferences based on relative, 
rather than absolute utility. In Table jS] the act preferred by safety has a lower 
minimum utility than the act preferred by maximin utility, just as the act picked 
by minimax regret neglects a higher maximum utility in order to minimize the 
margin to each state’s maximum utility. The shared preference for relative over 
absolute performance is reflected in a striking similarity in the structure of the 
value functions for regret and safety. In comparison, minimax regret can be 
expressed for all acts a,b as: 

a '^reg M b iS min(C7(a, s) — maxt7(a', s)) > min(C/(6, s) — maxC/(a', s)). 

’ s^S a'^M s^S a'^M 

Similarly, maximin safety is represented for all acts a, b as 
a >“so/,M & iff min([/(a, s)— min C7(a', s)) > min(C/(6, s) — min [/(o', s)). 

S^S CL^ S^S CL^ 


II 


The structural resemblance suggests a common form for the value function. By 
defining a menu-dependent anchoring function < : S' x 2"^ —^ R, we can represent 
several previously discussed value functions as: 

Vt{a,M) = minsgsC7'(a, s,M, t), 

where U'{a, s, M, t) = U(a, s) — t{s, M) can be viewed as an anchored effective 
utility. One can see that Vt represents maximin utility if t(s, M) = 0; min¬ 
imax regret if t(s,M) = max^/gM s); and maximin safety if t(s,M) = 
U{a', s). Note that by varying just the anchoring function t, we can 
obtain all the mentioned decision rules, and more. While we focus only on max¬ 
imin safety in this paper, other forms for t(s, M) maximize the positive margin 
from a state-dependent average, median, or some other characteristic of interest 
to a DM. For example, college students might seek to conservatively maximize 
their margin above a desired quantile, in order to achieve a particular grade. 

The present work is motivated by behavioral observations of the decoy effect 
that are typically described in empirical quantities such as distance, price and 
volume, and thus is most intuitive in a quantitative framework. However, the key 
observation is that safety, like regret, is a notion of relative performance with 
respect to a set of outcomes, rather than absolute performance. As absolute 
quantitative utility C7 : A —^ R can be generalized to a qualitative framework 
by replacing the U with a mapping A —^ L for some ordered set L, relative utility 
may be made qualitative by considering the mapping with 2^ x A —>■ L. In the 
case of safety and regret, the particular element of 2^ is the set of all possible 
outcomes in a state, given a menu of acts. Aggregation of N state-specific 
orderings into an ordering over acts can be accomplished by an aggregation 
function M : —>• L [TH]. This generalization can be readily applied to 

various characterizations of uncertainty, including probability, plausibility, and 
the strict uncertainty used in this paper m- While the authors expect that the 
present quantitative axiomatization can be adapted to a qualitative framework 
(see, e.g. [7]), it is beyond the scope of the current paper. 
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